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Abstract: Coronaviruses (CoVs), a genus containing about 26 known species to date, cause highly prevalent diseases and 
are often severe or fatal in humans and animals. In 2003, a previously unknown coronavirus was identified to be the 
etiological agent of a global outbreak of a form of life-threatening pneumonia called severe acute respiratory syndrome 
(SARS). No efficacious therapy is currently available, and vaccines and drugs are under development to prevent SARS- 
CoV infection in many countries. The CoV main protease (M pro ), which plays a pivotal role in viral gene expression and 
replication through a highly complex cascade involving the proteolytic processing of replicase polyproteins, is an attrac¬ 
tive target for drug design. This review summarizes the recent advances in biological and structural studies, together with 
development of inhibitors targeting CoV M pro s. It is expected that inhibitors targeting CoV M pro s could be developed into 
wide-spectrum antiviral drugs against existing and possible future emerging CoV-associated diseases. 
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1. INTRODUCTION 

Coronaviruses, characterized as enveloped, positive- 
stranded RNA viruses with the largest known genome, be¬ 
long to the genus Coronavirus of the family Coronaviridae 
[1, 2], There are approximately 26 species of coronaviruses 
(CoVs) [3-6] which can be classified into three distinct 
groups according to their genome sequences and serological 
reactions [2]. They infect humans and multiple species of 
animals, causing a variety of highly prevalent and severe 
diseases. For example, human coronavirus (HCoV) strains 
229E (HCoV-229E), NL63 (HCoV-NL63), OC43 (HCoV- 
OC43), and HKU1 (HCoV-HKUl) are responsible for a 
significant portion of upper and lower respiratory tract in¬ 
fections in humans, including common colds, bronchiolitis, 
and pneumonia. They have also been implicated in otitis 
media, exacerbations of asthma, diarrhea, myocarditis, and 
neurological disease [1, 7-11], In 2003, a previously un¬ 
known HCoV called severe acute respiratory syndrome 
coronavirus (SARS-CoV), the emergence of which was 
most likely as a result of animal-human transmission [12], 
was identified as the etiological agent of a global outbreak 
of a life-threatening form of pneumonia called severe acute 
respiratory syndrome (SARS) [13-16]. SARS ultimately 
infected more than 8000 people and killed approximately 
800 people worldwide, with about 10% mortality rate, be¬ 
fore it was effectively brought under control. From the be¬ 
ginning of the SARS epidemic, efforts to identify the natural 
host for SARS-CoV have never ceased. Recently two inde¬ 
pendent research groups have discovered a coronavirus 
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closely related to SARS-CoV in bats, which has prompted a 
new health alert [5, 6], Animal coronaviruses, including 
avian infectious bronchitis virus (IBV), turkey coronavirus 
(TCV), porcine transmissible gastroenteritis vims (TGEV), 
porcine hemagglutinating encephalomyelitis vims (HEV), 
canine coronavims (CCoV), feline infectious peritonitis 
vims (FIPV) and bovine coronavirus (BCV), are significant 
pathogens for chickens, turkeys, pigs, dogs, cats and cattle 
[2, 17], Most of the known coronaviruses are highly infec¬ 
tious with high mortality in young animals, resulting in sig¬ 
nificant economic losses for the animal industry worldwide. 

Although vaccines against some animal coronaviruses, 
such as IBV and CCoV, are routinely used to prevent seri¬ 
ous diseases, there are several potential problems [1, 3, 11], 
At present, there are no licensed vaccines or specific drugs 
are available to prevent HCoV infection [1, 11], Due to lack 
of efficacious therapies, the mortality rate was high during 
the SARS outbreak. Consequently, great efforts have been 
focused on the development of vaccines and dmgs against 
SARS-CoV. Viral-vectored vaccines [18-20] and DNA vac¬ 
cines [21-23] have been tested in animal models with suc¬ 
cessful results, and currently an inactivated SARS-CoV 
vaccine is in clinical trials in China [24]. However, safety 
remains the major concern. Drug development strategies are 
focused on two main avenues: inhibitors to block vims entry 
into the host cells, and compounds to prevent viral replica¬ 
tion and transcription. The CoV main protease (M pr °), which 
plays a pivotal role in mediating viral replication and tran¬ 
scription, is a particularly attractive target for anti-SARS 
dmg design [25, 26]. Several reviews published previously 
have touched on the topic of coronavims main proteases 
[25-27]. In this review, recent advances in biological and 
structural studies, and particularly in development of in¬ 
hibitors of the CoV M plo s, will be summarized and dis¬ 
cussed. 

© 2006 Bentham Science Publishers Ltd. 
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2 . STRUCTURE AND FUNCTION OF CORONA- 
VIRUS M pro 

2.1. Overview 

As the largest RNA viruses known to date, the size of 
coronavirus genomes range from 27 to 31 kb [5, 7, 8, 28- 
40]. About two-thirds of the genome encodes two overlap¬ 
ping polyproteins, ppla (450-500 kDa) and pplab (750-800 
kDa), which undergo extensive proteolytic processing by 
viral proteases to produce multiple functional subunits. 

These functional subunits are involved in formation of the 
replicase complex to carry out viral replication and tran¬ 
scription [25, 26, 41]. The viral proteases mentioned above 
are classified into accessory proteases and the main protease 
(M pr °, also called 3C-like protease). Accessory proteases are 
papain-like cysteine proteases which cleave the N-proximal 
polyproteins regions at two or three limited sites [25, 26]. 
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The main protease is a chymotrypsin-like cysteine protease 
(~33 kDa) [42, 43], which not only processes at its own 
flanking sites within the polyproteins, but also directs the 
processing of all downstream domains of the replicase poly¬ 
proteins via at least 11 conserved cleavage sites [25, 26, 44] 
(see Fig. 1). It is termed the main protease because of its 
dominant role in processing replicase polyproteins and gene 
expression. The alternative name of 3C-like protease was 
designated after the picomavirus 3C proteases because of 
the similar substrate specificities and the identification of 
cysteine as a catalytic residue in the context of a predicted 
two-[3-barrel structure [45, 46]. 

The existence of the coronavirus M pr ° was originally pre¬ 
dicted by sequence analysis of IBV replicase polyprotein in 
1989 [46]. Although comparative sequence analysis was 
subsequently extended to include the replicase genes of 
MHV, HCoV-229E and TGEV [36, 38, 39], the first ex- 
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Fig. (1). The domain organization and proteolytic processing of coronavirus replicase polyproteins from human coronavirus 229E (HCoV- 
229E), murine hepatitis virus (MHV), SARS coronavirus (SARS-CoV) and avian infectious bronchitis virus (IBV). Shown are the replicase 
polyproteins ppla and pplab; the processing products of ppla are termed non-structural proteins (nsp) 1-11, and those of pplab are termed 
nspl-nsplO and nspl2-nspl6. Cleavage sites for the main protease (M pro ) are marked in orange arrows; cleavage sites for the papain-like 
cysteine protease 1 (PLl pro ) are marked in yellow arrows; and cleavage sites for the papain-like cysteine protease 2 (PL2 pro ) are marked in 
blue arrows. Key to abbreviations: Ac, acidic domain; PLl pro , papain-like cysteine protease 1; A, X domain with adenosine diphosphate - 
ribose 1 "-phosphatase activity; SUD, SARS-CoV unique domain; PL2 pm , papain-like cysteine protease 2; Y, Y domain containing a trans¬ 
membrane domain and a putative Cys/His-rich metal binding domain; TM1 , TM2, TM3, transmembrane domains 1, 2, 3; M pm , main protease 
(or 3C-like protease); RdRp, RNA-dependent RNA polymerase; HEL, helicase; ExoN, 3'-to-5' exonuclease; Xendoll, poly(U)-specific en- 
doribonuclease; MT, S-adenosylmethionine-dependent ribose 2'-0-methyltransferase; C/H, Cys/His-rich domains predicted to bind metal 
ions. IBV ppla and pplab do not possess a counterpart to nspl of other coronaviruses. The PLl pro of IBV is crossed out to indicate that it is 
proteolytically inactive. Figure adapted from [25]. 
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perimental evidence of protease activity was reported for 
IBV in 1994 [47]. Afterwards, extensive activity studies of 
M pl ° in different expression systems were extended to MHV 
and HCoV [48, 49]. The information accumulated from 
different coronaviruses can be used to map the M pro proc¬ 
essing of replicase polyproteins from all three coronavirus 
groups [25] (see Fig. 1). 


although SARS-CoV M pl ° shares a slightly higher sequence 
identity with TGEV M p '° (44 %) than with the HCoV 229E 
enzyme (40 %), the structure is significantly more similar to 
that of the latter (r.m.s. deviations 2.3 A and 1.6 A, respec¬ 
tively), which explains why FICoV 229E M pl ° is a better 
search model in determining the structure of SARS-CoV 
M pl ° by molecular replacement. 


2.2. Overall Structure 

The crystal structure of TGEV M pr ° at 1.96 A, which was 
the first structure of any M pl ° to be solved, was reported in 
2002 [43]. During the SARS outbreak and shortly after the 
SARS epidemic in 2003, the structures of HCoV 229E M pl ° 
and SARS-CoV M pr ° were respectively solved at 2.54 A and 
1.9 A [42, 50]. Later, several structures for SARS-CoV M pl ° 
were published one after the other with the aim of drug de¬ 
velopment [51-53]. TGEV and HCoV 229E both belong to 
group 1 [2]. SARS-CoV is considered to be an early split-off 
from group 2 and has been classified as a group 2b CoV [44, 
54], HCoV 229E M pro and TGEV M pro share 61 % primary 
sequence identity, whereas SARS-CoV M pr ° has only about 
40 % sequence identity with these two M plo s (see Fig. 2). 
Although the three M pro s are from two different groups, they 
share a similar overall structure. It is interesting to find that 


All three main proteases are comprised of three domains 
(see Fig. 3A and B). Domains I (TGEV M pr °: residues 8- 
100; HCoV M pr °: 8-99; SARS-CoV M pro : 8-101) and II 
(TGEV M pr °: residues 101-183; HCoV M pro : 100-183; 
SARS-CoV M pr °: 102-184) both have an anti-parallel 13- 
barrel fold, which is similar to the serine proteases of the 
chymotrypsin family. These two domains have very limited 
similarity with picomavirus 3C proteases [55-58], which 
also have a chymotrypsin-related structure, suggesting that 
the name main protease is more appropriate than the some¬ 
what misleading 3C-like protease. The superposition of 
domains 1 and II of the TGEV M p ‘° onto the equivalent do¬ 
mains of the HAV 3C protease yields an r.m.s.d. of 2 A for 
114 equivalent (out of 184 compared) C a pairs [43]. The 
substrate-binding site is located in a cleft formed between 
domains I and II and the catalytic site lies at the center of the 
cleft. In contrast to picornavirus 3C proteases, there is an 
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Fig. (2). Multiple sequence alignment of coronavirus M pro s from human coronavirus 229E (HCoV-229E), murine hepatitis virus (MHV), 
SARS coronavirus (SARS-CoV) and avian infectious bronchitis virus (IBV). The catalytic dyad (41-His and 144/145-Cys) is shown by blue 


triangles. 
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Fig. (3). A). The structure of the porcine transmissible gastroenteritis virus (TGEV) M pro monomer. The M pro monomer structure is drawn in 
ribbon representation and colored from blue at the N-terminal to red at the C-terminal. The catalytic dyad (41-His and 144-Cys) are shown as 
sticks and domains I. II and III are marked. B). Superposition of coronavirus M pro s from human coronavirus 229E (HCoV-229E; blue), mur¬ 
ine hepatitis virus (MHV; yellow), SARS coronavirus (SARS-CoV; red) and avian infectious bronchitis virus (IBV; green). It should be 
noted that the refinement of IBV M pro is still in progress and the MHV M pro structure is a homology model. Please see [3] for further details. 


additional domain III in CoV M pro s with a unique topology 
[43]. Domain III (TGEV M pro : residues 200-302; HCoV 
M pro : 200-300; SARS-CoV M pro : 201-303) contains five 
largely anti-parallel a-helices arranged into a globular clus¬ 
ter, and is connected with domain II via a long loop region 
consisting of 16 amino acids. The N-terminal within domain 
I folds onto domain III, bringing it into close proximity with 
the C-terminal of domain III. 

CoV M pr ° can form a homodimer both in the crystal and 
in solution (see Fig. 4). In the crystal, one protomer is ori¬ 
ented perpendicular to the other and the solvent accessible 
surface area (per protomer) buried upon dimerization of 
each protomer ranges from 1300-1600 A 2 . In each pro¬ 
tomer, domains II, III and the “N-finger” are involved in 
dimer interface formation. The “N-finger”, defined as the N- 
terminal residues 1 -7, inserts between domains II and III of 
the partner subunit [50]. Dynamic light scattering results 
show that both HCoV 229E and TGEV M plo s exist as a 
mixture of monomers (-65%) and dimers (-35%) in diluted 
solutions (1-2 mg/ml). Analytical gel filtration, ultracentri¬ 
fugation and isothermal titration calorimeter experiments 
both indicate that SARS-CoV M pr ° is also a mixture of 
monomers and dimers in solution [59-61], although the val¬ 
ues of the dissociation constant (K d ) determined by the dif¬ 


ferent methods are incongruent, ranging from JI.M to nM 
[51, 59, 62, 63]. While the N-terminal and domain III are 
observed to participate in dimerization interactions in the 
crystal structure, the role of the N-terminal in maintaining 
the quaternary structure has been reported with inconsistent 
results. In their SARS-CoV M pl ° study, Chou et al. showed 
that an ion pair formed by 4-Arg at the N-terminal of one 
protomer and 290-Glu at the C-terminal of domain III in its 
partner is crucial in dimer interactions. Mutating either of 
these two residues will increase the dissociation constant 
[62]. Another experiment demonstrated that deletion of 1-3 
amino acids from the N-terminal will not have a substantial 
effect on dimerization, whereas the truncation of amino 
acids 1-4 shows a major form of a monomer [63]. However, 
Shi et al. reported that after the dissection of SARS-CoV 
M pl ° into two parts, the chymotrypsin fold containing do¬ 
mains I and II is a monomer while the additional domain III 
exists as a stable dimer [64]. Chen et al. reported that K d of a 
N-terminal (1-7 amino acids) truncated SARS-CoV M pl ° 
changes slightly compared with the full-length protease by 
isothermal titration calorimeter analysis, suggesting that the 
N-terminal should not be indispensable for dimerization 
[61]. Nonetheless, it is generally accepted that domain III is 
essential to maintain quaternary structure and the “N-finger” 
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Fig. (4). The SARS-CoV M pro in complex with a CMK inhibitor, as reported by Yang et al. [501. The M pro is shown in ribbon representation 
with protomer A in red and protomer B in blue, and is covered by a transparent molecular surface. The CMK inhibitors bound to protomers 
A and B are shown in yellow stick representation. 


is important for enzymatic activity, the latter of which will 
be discussed below. 

Another question is whether the role of domain III is 
only limited to stabilization of the dimer, thus maintaining 
the activity of the M pr °, as has been verified by loss of activ¬ 
ity following deletion of part or whole of domain III in 
TGEV and SARS-CoV M pro s [43, 60, 63, 65]. In characteri¬ 
zation experiments of temperature-sensitive MHV mutants, 
substitution of 291-Phe in domain III of M pl ° with Leu 
causes an RNA-minus phenotype at the restrictive tempera¬ 
ture [66]. In the MHV M pl ° homology model constructed 
from SARS-CoV M pro [3], 291-Phe is located far from the 
active site and does not participate in dimerization, and so 
there is little probability of this mutation independently 
affecting the protease activity. As 291-Phe is situated near 
the protease surface, it has the opportunity to interact with 
other domains in polyproteins. One possible role for domain 
III might be to regulate the proleolytic processing pathways 
for the polyproteins through interaction of specific regions 
of downstream domains of the replicase polyproteins, al¬ 
though we can not exclude other functions of domain III in 
RNA synthesis. 

2.3. Catalytic Site and Substrate Binding Pocket 

It was originally proposed from IBV sequence analysis 
that the catalytic center of CoV M pl ° resembles that of other 
viral 3C and 3C-like proteases and might possibly include a 
His-Asp(Glu)-Cys catalytic triad [46]. Mutagenesis studies 
have confirmed that His and Cys residues are essential com¬ 


ponents of the catalytic centre in HCoV 229E, IBV, MHV 
and SARS-CoV M pro s [48, 49, 51, 67-71], There has been a 
long-stranding debate on the existence of a third member in 
the catalytic center of CoV M pr °s. Great efforts made in 
sequence analysis and mutagenesis studies could not com¬ 
pletely solve this question [26]. Only when the crystal 
structure of TGEV M pl ° was determined in 2002 was this 
problem finally clarified [43]. Compared with serine prote¬ 
ases and other cysteine proteases, which adopt a catalytic 
triad, CoV M pl ° only takes His-Cys as a catalytic dyad and 
completely lacks a third catalytic residue [3, 42, 43, 50]. In 
contrast to picornavirus 3C proteases, TGEV, HCoV 229E 
and SARS-CoV M pr °s all have a particularly ordered water 
molecule in the corresponding position to the third catalytic 
member. This water forms at least three hydrogen bond 
interactions with surrounding residues (including the cata¬ 
lytic dyad member His) in all M pl ° structures determined to 
date, suggesting its possible role is to stabilize the proto- 
nated histidine in the intermediate state during proteolytic 
cleavage, reminiscent of the function of Asp (Glu) in classic 
serine protease catalysis. The replacement of cysteine in the 
charge-relay system with a serine in IBV and SARS-CoV 
will produce a protease with residual activity, supporting a 
classical general base mechanism related to serine proteases 
rather than a thiolate-imidazolium mechanism for papain¬ 
like proteases [69, 71]. 

Coronaviruses only encode one M pl °, which is highly 
selective for substrates. A typical substrate sequence for 
M pl ° is Leu-Glnl(Ser, Ala, Gly) in polyproteins, which is 
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conserved among different coronavirus M pr °s [26]. Hegyi et 
al. discovered that a synthetic peptide representing the N- 
terminal HCoV 229E M pro cleavage site was shown to be 
effectively hydrolysed by non-cognate M plo s [72]. In studies 
on SARS-CoV M pr °, it was reported that the N-terminal 
SARS-CoV M pr ° processing sequence could be efficiently 
cleaved by M pl °s of all other groups of CoV, including the 
recently identified HCoV-NL63 and HCoV-HKUl [3, 42, 

73] . This is explained by a comparison of representatives 
from all three groups of CoVs: the structures of TGEV 
(group 1), HCoV-229E (group 1), SARS-CoV (group 2b) 
M pl0 s, a homology model of MHV (group 2a) M p '° and the 
structure of IBV M p '° under refinement [3, 27, 42, 43, 50]. 

Superposition of the structures and model reveal that 
backbones of the CoV M plo s substrate binding pocket su¬ 
perimpose particularly well (see Fig. 3B), except for a small 
segment located on the outer wall of S2 [3], and so attention 
is focused on the variation of side chains forming important 
subsites SI, S2, S4 and SI'. In coronavirus M plo s, the SI 
subsite endows the peptidases with unique specificity for 
substrate recognition and PI is invariably occupied by Gin 
in polyprotein processing. Anand and colleagues reported 
the structure of TGEV M pro in complex with a substrate 
analog, hexapeptidyl chloromethyl ketone inhibitor, Cbz- 
Val-Asn-Ser-Thr-Leu-Gln-CMK, derived from the N- 
terminal processing cleavage site [42]. This structure re¬ 
vealed that the side chains of 165-Glu, 162-His, 171-His, 
and 139-Phe in TGEV M pl ° (also conserved in other M pr °s) 
are incorporated with other backbone elements to constitute 
the SI site, which has an absolute requirement for Gin at the 
PI position via two hydrogen bonds. One hydrogen bond is 
formed by the side chain of Gin and the Ne atom of His-162, 
which is situated at the bottom of SI subsite. The required 
neutral state of His-162 at physiological pH appears to be 
maintained by two important interactions: (i) stacking with 
the phenyl ring of Phe-139 and (ii) acceptance of a hydrogen 
bond from the hydroxyl group of the buried Tyr-160, which 
is conserved in other known SARS-CoV and HCoV 229E 
M pro structures [42, 50-52], The role of His-162 and Tyr-160 
was experimentally verified by mutagenesis analysis of 
corresponding residues in HCoV 229E and FIPV M plo s [68, 

74] . Inserting in between domain III of the parent protomer 
and domain II of the neighboring protomer in the dimer, the 
N-finger plays an important role for the protease activity. In 
particular, the first amino acid of the N-finger in the parent 
protomer commonly forms ionic or hydrogen bond interac¬ 
tions with the essential residues constituting SI site of the 
partner protomer, helping to maintain the active conforma¬ 
tion of this subsite [43, 50, 52]. Deletion of residues 1-5 in 
the related TGEV M p ‘° renders the enzyme almost com¬ 
pletely inactive [43]. Site-direct mutation of the crucial resi¬ 
due or deletion of the whole N-finger will result in a great 
loss of activity in SARS-CoV M pro [61-63, 65]. 

The side chains of 164-Leu, 51 -lie, 41-His, and 53-Tyr, 
as well as the alkyl portion of side chains of 186-Asp and 
47-Thr, are involved in forming a deep hydrophobic S2 
subsite that can accommodate the relatively large side chain 
of Leu in TGEV M pr °. This same feature can also be ob¬ 
served in the HCoV-229E M pr °. Several conservative sub¬ 
stitutions occur in other CoV M pr °s (164-Leu —> 165-Met in 
SARS-CoV and MHV M pro s; 53-Tyr ^ 50-Trp in IBV 


M pl °). Another minor difference is observed in SARS-CoV 
and MHV M pr °s, where the outer wall segment is composed 
of a short 3io-helix from residues 45-50, in contrast to the 
less regular structure in HCoV and TGEV M pr °s. With re¬ 
spect to the structure of IBV M pro undergoing refinement, no 
cleai' electron density was observed in the corresponding 
stretch of residues 44-47. Apart from a few exceptions, CoV 
M p '° has a Leu residue in the P2 position [26]. However. 
SARS-CoV M pl ° has Phe at the P2 site in its C-terminal 
cleavage sequence, representing a structural difference in the 
S2 site'compared with TGEV and HCoV M plo s [51, 52], The 
side chain of the residue at the P3 position in all M plo s is 
solvent-exposed, so this site was expected to tolerate a wide 
range of functionality. The side chains of 164-Leu, 166-Leu, 
184-Tyr, and 191-Gln that form the S4 hydrophobic subsite 
of TGEV are conserved in other CoV M pro s, with the excep¬ 
tion of the following conservative substitutions: 184-Tyr —> 

184- Phe in HCoV M pl °; 164-Leu 165-Met, 184-Tyr 

185- Phe in SARS-CoV. The congested S4 site implies that 
only small amino acid residues such as Ser, Thr, Val or Pro 
can be accommodated at the P4 position [27]. Although the 
SL subsite of TGEV M pl ° formed by 27-Leu, 41-His and 
47-Thr is not deep, similar with other CoV M plo s, it has 
sufficient space for common small PL residues such as Ser, 
Ala, or Gly to extend into this subsite. This could help in the 
design of substrate-analogue inhibitors targeting CoV M plo s 
with higher binding affinity [3]. Yang and colleagues have 
solved the structure of SARS-CoV M p '° mutant in complex 
with an 11-mer peptidyl substrate, which could provide 
whole insight into the interactions between substrate and 
protease (Rao, personal communication). In the first crystal 
structure of SARS-M pl ° determined at pH 6.0, the S1 speci¬ 
ficity pocket of one protomer is in the active conformation 
while that of the neighboring protomer is partially collapsed, 
resulting in only about 50 % protease activity. Activity 
curves show that SARS-CoV protease activity increases 
from pH at 6.0 to approximately physiological pH [50, 53, 
59], and so a pH-triggered switch for the catalytic activity of 
the peptidase was proposed [50]. Tan and colleagues have 
reported similar results by structural analysis and molecular 
dynamics simulations [53]. The crystal structures solved by 
Lee and co-workers from crystals with different space 
groups at pH 6.5 showed that the substrate-binding regions 
of both protomers are in the catalytically competent confor¬ 
mation. Thus the authors proposed the possibility of an al¬ 
ternative or additional mechanism [52]. 

Although CoV M pr ° has similar substrate specificity with 
picomavirus 3C proteases as mentioned above, no reports 
have shown that any CoV M p '° could efficiently process the 
substrate of picomavirus 3C proteases, or vice versa, which 
can be inferred from the differences substrate binding pock¬ 
ets between CoV M plo s and picomavirus 3C proteases. 

It is generally agreed that M pl ° commonly processes the 
downstream cleavage sites in polyproteins in trails, which 
was characterized for M plo s of various species [25]; how¬ 
ever, it remains largely unknown whether the M pro releases 
itself from polyproteins in trans or in cis. In TGEV M pr ° 
structures, it seems reasonable to suggest that the cleavage 
even occurs on an infermolecular basis, which has also been 
suggested for MHV M pr ° based on biochemical evidence 
[75]. However, the previously reported structures of M pr ° are 
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all in the mature form without any clues for the flanking 
hydrophobic regions (TM domains). A number of MHV and 
1BV precursors containing M p ‘° were found to require mi¬ 
crosomal membranes for efficient autocleavage to release 
M p ‘° [69, 76], suggesting that the autocleavage event might 
take place on the membrane and that hydrophobic regions 
help the cleavage sites to approach the active site by inter¬ 
action with the membranes. It is difficult to tell whether 
cleavage in cis will occur or not without further considera¬ 
tion of the interaction between TM domains and mem¬ 
branes. In the N-terminal autocleavage event, for instance, 
no matter whether the autocleavage is in cis or in trails 
within a dimer, domain I will undergo a large conforma¬ 
tional change between pre and post cleavage. Hence, the 
structure of M p ‘° with flanking hydrophobic regions will 
help to address this question. Recently, Hsu et al. reported 
that they found in a dimeric structure of Cl45A mutant of 
SARS-M pl °, the active site of one protomer contains the C- 
terminal of one protomer in another asymmetric unit. There¬ 
fore, they proposed an autocleavage mechanism to explain 
this [51]. 

3. INHIBITOR DESIGN 

3.1. Enzyme Activity Assay 

In the development of CoV M p '° inhibitors, efficacious 
methods are required to measure the enzyme activity and the 
effect of compounds screened. Initially the proteolytic ac¬ 
tivity of M pl ° was characterized mainly through processing 
the expressed polypeptide in vitro and analyzing the prod¬ 
ucts by electrophoresis [48, 49]. Later the HPLC method 
was utilized to analyze the cleavage of M pl ° on synthesized 
peptides [42, 72, 77] and determine the kinetic parameters 
for the proteases [59, 68]. Using an HPLC-based peptide 
cleavage assay. Fan and colleagues synthesized 34 peptide 
substrates and evaluated their specificity for SARS-CoV 
M pl ° by measuring hydrolytic activity [78]. However, these 
two methods are not sensitive and convenient enough for 
large-scale inhibitor screening. Afterwards a continuous 
colorimetirc assay was reported as a substitute for SARS- 
CoV M pr ° activity assay based on cleavage at the Gln-pNA 
bond for a substrate Thr-Ser-Ala-Val-Leu-Gln-pNA [79]. At 
present, a fluorescence resonance energy transfer (FRET) 
technique based on a fluorescence-labeled substrate has 
been developed and is routinely utilized to monitor the M pl ° 
activity in vitro for anti-CoV inhibitor screening and drug 
design [51, 52, 60-63, 73, 80-88]. Parea and co-workers, 
invented a genetic screen assay to monitor the SARS-CoV 
M pl ° activity, in which the protease activity is associated 
with the proliferation of lambda phage in infected Escheri¬ 
chia coli cells [89]. 

3.2. Design of Inhibitors Targeting CoV M pro 

To date, the inhibitors reported for CoV M pl ° have mostly 
been compounds targeting the active site of CoV M pl °. Al¬ 
though inhibitors preventing M p '° dimerization were sug¬ 
gested [27, 64], there have been no related reports. In the 
early stages of CoV M pl ° studies, a series of typical serine, 
cysteine, aspartic acid and metalloprotease inhibitors were 
screened for CoV M plo s. Ziebuhr and colleagues reported 
that 3,4-dichloroisocoumarin, phenylmethylsulfonyl fluoride 


(PMSF), Pefaloc SC, tosyl lysyl chloromethyl ketone 
(TLCK), antipain and ZnCL all function as HCoV 229E 
M pl ° inhibitors [68]. A cysteine protease inhibitor E64d was 
shown to prevent MHV RNA synthesis and virus replica¬ 
tion, possibly by inhibition of MHV M pro processing activity 
[75]. However, these inhibitors were assayed primarily for 
characterization of the CoV M p '° active site and its role in 
viral RNA synthesis and replication, but not for pharmaceu¬ 
tical use due to their non-specificity. 

During the global SARS epidemic, the M pl ° was consid¬ 
ered to be an attractive target [42], which led to an upsurge 
in the development of anti-SARS-CoV M p '° inhibitors. Since 
the structure of SARS-CoV M pl ° was not available at the 
height of the outbreak, a series of homology models were 
constructed from the TGEV M pro and HCoV 229E M pro for 
virtual screening of compounds suitable for binding into the 
substrate binding pocket [42, 90-93]. However, no further 
reports have shown that the compounds suggested above are 
effective in inhibiting SARS-M P '° activity or preventing 
viral replication. In the wake of the SARS outbreak, the 
inhibitors were developed predominantly by three methods: 
(1) large-scale screening of structurally diverse small mole¬ 
cules from compound libraries or natural sources such as 
plant extracts; (2) virtual screening of compounds from 
chemical databases based on the SARS-CoV M pr ° structure; 
and (3) ab initio design of small molecules directly from the 
structure, which will be respectively discussed below. A 
series of representative inhibitors identified against SARS- 
CoV M pro is listed in Table 1. 

3.2.1. High-Throughput Screening 

The rapid, sensitive fluorescence resonance energy trans¬ 
fer (FRET) technique provides the possibility to simultane¬ 
ously screen a large number of compounds from available 
drugs, chemical libraries or natural plant extracts. Except for 
a few cases, most research groups have chosen to adopt this 
method. Hsu and co-workers reported that metal ions such 
as Hg 2+ , Ztr 1 and Cu 2+ are SARS-CoV M pro inhibitors, pos¬ 
sibly because they have the ability to coordinate to the cata¬ 
lytic cysteine in the protease active site [94]. From a com¬ 
pound library, the authors also found that phenylmercuric 
nitrate (see Table 1), thimerosal and hexachlorophene are 
effective inhibitors against SARS-CoV M pro . However, safety 
is the main concern of the metal ions and particularly the 
toxicity of compounds containing mercury. Although pre¬ 
liminary data showed that hexachlorophene can decrease the 
amount of viral spike protein in the Vero E 6 cells infected 
by SARS-CoV below the cytotoxicity concentration, the 
effect of this compound needs to be fully characterized in 
the prevention of viral replication and cytotoxicity. Liu and 
colleagues reported similar results for hexachlorophene and 
its derivatives in enzymatic inhibition assays [ 88 ]. Another 
study by Wu and co-workers reported that a transition-state 
analog inhibitor of HIV protease is active against the SARS- 
CoV M pro (A,=0.6 pM) [82]. In a large scale s creening, Kao 
and colleagues screened 50,240 small molecules, in which 
they identified 104 compounds with anti-SARS-CoV activ¬ 
ity. From these active compounds, the authors discovered 
one compound named MP576 that displays inhibition activ¬ 
ity against SARS-CoV M pl ° with an IC 50 of 2.5 pM and an 
EC 50 of 7 pM in cell-based assays [81]. Similarly, Blanchard 
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Table 1. Inhibition of SARS CoV M pro Activity In Vitro and SARS-CoV in Cell Culture by Published Compounds 



Theaflavin-3,3’- 
digallate (TF3) [95] 


9.5 pM (IC 50 ) 


N/A 






Design of Inhibitors Targeting Coronavirus M" 


Current Pharmaceutical Design, 2006, Vol. 12, No. 35 4581 


(Table 1) contd.... 


Compound 

Structure 

Enzymatic 

assay 

Antiviral activity 

in cell-based 

assay (pM) 

tc 5 » 

(HM) 

Hesperetin [96] 

nT' 

ho tX^ h 

OH O 

60 nM (IC 50 ) 

N/A 

2718 

Calmidazolium [79] 

Cl 

rfi' 

O/- 

Cl 

61 nM (Kd 

N/A 

N/A 

Cinanserin [73] 

0 

4.92 nM (IC 50 ) 

31 (IC 50 ) 

>134 

Aza-peptide 
epoxides [52] 

h 2 n o 

U " -XQ" 

18 nM (Kd 
35xl0 3 S 1 (ks) 

N/A 

N/A 

FL-166 [60] 

HO OH 

B 

HO' ^OH 

0.04 |iM (Ki) 

N/A 

N/A 

8c [80] 

O. 

^^NH 

Bn ° / / y 

AVA'VrV) 

° o HN v^r 

' o no 2 

0.6 nM (IC 50 ) 

N/A 

N/A 
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(Table 1) contd.... 


Compound 

Structure 

Enzymatic 

assay 

Antiviral activity 

in cell-based 

assay (pM) 

TC;« 

(pM) 

2a [87] 

0 \ C1 

■jcAViX 

0.03 pM (K,) 

N/A 

N/A 

4o [84] 

XX^° 

0.95 pM (IC 50 ) 

N/A 

N/A 

N3 [3] 

0 

h 0 h 0 fV r^N 

‘V'yVk 

O = H O \ H O 

r 

9 pM (K,) 
3.1xl0 3 S 1 (k 3 ) 

6 (IC50) 6 

>500 


“ N/A : data not available. 
b Unpublished data. 


and colleagues screened 50,000 small molecules and identi¬ 
fied 5 compounds exhibiting inhibitory activity (ICso= 0.5-7 
pM) towards SARS-CoV M pro [83]. In addition to small 
molecules from chemical libraries, some research groups 
have turned to natural plant extracts such as Chinese herbs. 
Two research groups individually claimed that tannic acid, 
3-isotheaflavin-3-gallate and theaflavin-3,3'-digallate from 
black tea and one plant-derived phenolic compound, hes- 
peretin, are SARS-CoV M pro inhibitors [95, 96], 

3.2.2. Virtual Screening 

In a virtual screening of compounds from several chemi¬ 
cal databases, Liu and colleagues identified calmidazolium, 
an antagonist of calmodulin, as a SARS-CoV M pl ° inhibitor 
with a K t of 61 pM from enzyme inhibition assays [79]. 
Chen and co-workers identified cinanserin, a serotonin an¬ 
tagonist, as an inhibitor of both SARS-CoV and HCoV- 
229E M pro s from a database containing more than 8,000 
compounds by a docking approach [73]. Cinanserin has an 
IC 5 o of about 5 pM against both of these two proteases in 
enzyme inhibition assays, and an EC 50 ranging from 19 to 
34 pM in cell-based assays. The authors also reported that 
the enzyme was not completely inhibited at the maximum 
drug concentration, while the amount of virus was signifi¬ 
cantly reduced in cell culture. This incongruity was ex¬ 
plained as additional drug effects by the authors, but could 
arise if cinanserin acts as an irreversible inhibitor with a 


small inactivation rate constant, and so could not completely 
inactivate the protease in a limited time during the enzyme 
activity assay. 

3.2.3. Ab Initio Inhibitor Design 

Many compounds designed directly from the M pl ° struc¬ 
ture are mechanism-based irreversible inhibitors, which 
could increase their inhibition effects. The inhibitor initially 
forms a reversible complex with the protease, which then 
undergoes a chemical step (nucleophilic attack by Cys) 
leading to the formation of a stable covalent bond. The 
evaluation of this series of inhibitors requires both the equi¬ 
librium-binding constant K t (designated as krfki) and the 
inactivation rate constant for covalent bond formation k, 
(see Equation (1)) [3, 97], 

E + I?==iEI—^->E-I ( 1 ) 

k 2 

The first reported SARS-CoV M pro irreversible inhibitor 
was a substrate-analog chloromethyl ketone (CMK) inhibi¬ 
tor, Cbz-VNSTLQ-CMK [50]. The sequence of this sub¬ 
strate-analog was derived from residues P6-P1 of the N- 
terminal autoprocessing site of TGEV M p '°. However, the 
two protomers of SARS-CoV M pl ° each exhibited an unex¬ 
pected binding mode (see Fig. 4), possibly resulting from 
the comparatively weak binding of peptidyl elements de¬ 
rived from the substrate of TGEV M pl ° and from the highly 
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reactive electrophile CMK. This would suggest that nucleo¬ 
philic attack might have occurred before a stable non- 
covalently bound enzyme-inhibitor complex was formed [3]. 
It should be noted that the CMK compound can react with a 
variety of proteases, including cellular proteases, without 
specificity due to their high electrophilicity. Thus, the CMK 
compound is mainly utilized to explore the active sites of 
proteases but is not suitable for direct use in drug develop¬ 
ment. Recently, Lee and colleagues designed an aza-peptide 
epoxide inhibitor [52], Cbz-Leu-Phe-AGln-EP-COOEt and 
solved the co-crystal structure in which the Cysl45 S' atom 
of the protease forms a covalent bond with the epoxide C3 
atom of the inhibitor. As expected, this substrate analog 
binds to the substrate binding pocket of the protease in a 
normal mode, as seen in the TGEV M pr ° complex [42]. 
Bacha and co-workers identified a cluster of serine residues 
near the active site cavity which were susceptible to target¬ 
ing by compounds containing boronic acid. In a series of 
designed bifunctional aryl boronic acid compounds, a com¬ 
pound named FL-166 is effective at inhibiting the protease 
with a K t of 40 nM [60]. Jain and colleagues reported a se¬ 
ries of keto-glutamine analogues with a phthalhydraido 
group at the a-position, whose derivatives are known HAV 
3C inhibitors, could act as reversible inhibitors against 
SARS M pr ° with IC 50 ranging from 0.6 to 70 pM [80]. Shie 
and colleagues reported that an anilide derived from 2- 
chloro-4-nitroaniline, L-phenylalanine and 4-(dimethyl- 
amino) benzoic acid can reversibly inhibit SARS-CoV M p '° 
with a K t of 30 nM [87]. Chen and co-workers reported that 
a series of isatin (2,3-dioxindole) derivatives, which are 
known covalent inhibitors against rhinovirus 3C protease, 
could inhibit SARS-CoV M p '° activity [98]. Unfortunately, 
previous studies have also shown that isatin compounds 
displayed poor antiviral activity [97], which might limit 
their use in drug development against SARS-CoV. In a re¬ 
cent study, Yang and colleagues proposed a strategy for 
preventing infection by existing and possible future emerg¬ 
ing coronaviruses. After analyzing the critical subsites of 
M pl ° substrate binding pocket from representatives of all 
three groups of CoVs M plo s, the authors concluded that all of 
those subsites are highly conserved. A compound (desig¬ 
nated as N3) incorporating a trans-a, [3-unsaturatcd ester 
with the peptidyl portion targeting this conservative region 
could rapidly inactivate multiple M plo s covering all three 
groups of CoVs in vitro, and shows potent antiviral activity 
with extremely low cytotoxicity. A uniform inhibition 
mechanism was elucidated from the structures of M pl °- 
inhibitor complexes from SARS-CoV and TGEV. In the 
SARS-CoV M pro complex with N3 (see Fig. 5), for instance, 
this compound binds to the shallow cleft formed by a por¬ 
tion of the strand ell and a segment of the loop linking do¬ 
mains II and III. The Cp atom of the Michael acceptor forms 
a covalent bond with the S Y atom of 145-Cys. The PI, P2, P4 
and PP groups of the inhibitor insert favorably into thier 
corresponding subsites. It is of interest that this compound 
could also rapidly inactivate the M pr °s of HCoV-NL63 and 
HCoV-HKUl, two recently identified HCoVs associated 
with bronchiolitis and pneumonia. The authors expect that 
further modification of these inhibitors could rapidly lead to 
the discovery of a single agent with clinical potential against 
existing and possible future emerging CoV-related diseases 

[ 3 ]. 


4. OTHER THERAPEUTIC TARGETS IN THE SARS- 
COV 

Much of the focus in structure-based drug design target¬ 
ing the SARS coronavirus has so far been made on the M p '°. 
However, a slew of recent SARS-CoV protein structures 
have opened new avenues towards the design of SARS anti¬ 
viral compounds, as well as for vaccine design. Of particular 
importance is the spike (S) glycoprotein, one of four struc¬ 
tural proteins that are required for viral assembly. The S 
protein is mainly responsible for binding to the host cell and 
for subsequent cell entry by virus-cell membrane fusion. 
Great attention is also being made to identify targets in¬ 
volved in the replication and transcription of the virus, 
which was mediated by machinery composed of so-called 
non-structural proteins. 

4.1. SARS Spike Protein Fusion Core 

The SARS-CoV S protein can be subdivided into an N- 
terminal half (SI) and C-terminal half (S2), but without 
proteolytic cleavage [99]. SI is responsible for binding to 
cellular receptors, while. S2 contains an internal fusion pep¬ 
tide and has two hydrophobic (heptad) repeat regions desig¬ 
nated HR1 and HR2 [100]. 

The structure of the spike (S) protein fusion core con¬ 
sisting of HR1 and HR2 regions was first determined in 
2004 by two groups in the post-fusion (or fusion-active) 
state (Fig. 6) [99, 101]. Briefly, the structure of the S fusion 
core structure is a six-helix bundle in which three HR1 heli¬ 
ces form a central coiled-coil surrounded by three HR2 heli¬ 
ces in an oblique, anti-parallel manner. HR2 peptides pack 
into the hydrophobic grooves of the HR1 trirner in a mixed 
extended and helical conformation, representing a stable 
post-fusion structure, similar to that observed for HIV-1 
gp41 L102]. 

The classical mechanism of enveloped virus and host¬ 
cell membrane fusion mediated by class I fusion proteins 
was first established by Don Wiley and colleagues from 
comprehensive studies of the influenza hemagglutinin (HA) 
[102, 103]. In subsequent years, extensive structural studies 
on the orthomyxovirus, retrovirus, paramyxovirus, and 
filovirus families have yielded a common fusion mechanism 
[102]. To confirm the value of fusion proteins as antiviral 
targets, one HIV-1 membrane fusion-inhibitory peptide T-20 
(Trimeris. Research Triangle Park, NC, USA), targeting the 
prehairpin intermediate, was recently approved by the US 
Food and Drug Administration as a new anti-HIV drug 
[104]. In the case of SARS-CoV, several peptides derived 
from the HR1 and HR2 regions of SARS-CoV spike pro¬ 
teins have been synthesized to block viral entry targeting the 
putative pre-hairpin intermediate [104-106]. Importantly, 
two groups have found that only peptides derived from HR2, 
and not from HR1, inhibited SARS-CoV infection [104, 
105]. Moreover, the efficacy of HR2 peptides derived from 
SARS-CoV spike protein is lower than those of correspond¬ 
ing HR2 peptides of murine coronavirus mouse hepatitis 
virus in inhibiting MHV infection [105]. One possibility 
might be the lower affinity of these peptides for the corre¬ 
sponding HR1 trimer [105], since a larger surface area is 
buried in the HR1-HR2 interface of MHV S2 than in SARS- 
CoV S2 [99]. In any case, elucidation of the HR1-HR2 
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Fig. (5). A). Surface representation of SARS-CoV M pro (blue, with residues shown in yellow) complexed with the N3 inhibitor (green) re¬ 
ported by Yang et al. [3], Water molecules are shown as red spheres. The PI. P2, P3, P4, P5 and PI' groups and residues forming the sub¬ 
strate binding pocket are labeled. B). Detailed view of the interactions between SARS-CoV M pro and the N3 inhibitor reported by Yang et al. 
[3]. The N3 inhibitor is shown in blue. Hydrogen bonds are shown as dashed lines and interaction distances are given. The covalent bond is 
labeled in red. 
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fusion core structure should prove beneficial for the discov¬ 
ery of viral entry inhibitors against SARS. 

4.2. SARS-CoV Spike Protein Receptor Binding Domain 
in Complex with the Cellular Receptor ACE2 

As previously described, the SI region within the spike 
protein features the receptor binding site while the S2 region 
mediates the fusion activity. One potential SARS-CoV re¬ 
ceptor has been identified as angiotensin-converting enzyme 
2 (ACE2) [107], and a fragment of the SI region has been 
shown to be required for tight binding to the peptidase do¬ 
main of ACE2 [108-110], It is this fragment, termed the 
receptor binding domain (RBD), which mediates the virus- 
receptor interaction and thus determines the viral host range 
and tropism. Changes in just a few residues in the RBD can 
result in efficient cross-species transmission [111, 112], 
Furthermore, the RBD also includes important viral-neutral¬ 
izing epitopes [113-116], suggesting that variants of the 
RBD could be used for the development of protein-based 
vaccines. The recent structure of the RBD in complex with 
the cellular receptor ACE2, determined by Stephen Harrison 
and colleagues [117], should therefore provide an important 
means for the design of effective coronavirus vaccines. 
From the complex structure, the authors observed tight 
complementarity between the shallow concave surface of 
the S protein RBD and one lobe of the human ACE2 pepti¬ 
dase domain (Fig. 6) and identified residues critical residues 
for species specificity. 

4.3. Non-Structural or Replicase Proteins 

The SARS-CoV replicase gene encodes 16 non-struc- 
tural proteins (nsps) with multiple enzymatic functions, 
which are known or are predicted to include types of en¬ 
zymes that are common components of the replication ma¬ 
chinery of plus-strand RNA viruses (see [25] for a review). 
A number of recent structures of replicase proteins, both 
alone and in protein-protein complexes, provide insights into 
the sophisticated function and assembly of the replica¬ 
tion/transcription machinery. Targeting and inhibiting the 
function of this machinery should be another important fo¬ 
cus of structure-based drug design. However, little remains 
known about the coronavirus replication/transcription ma¬ 
chinery, particularly the interactions between the various 
non-structural protein components, and further work is 
needed to identify viable targets. 

Nsp9, the crystal structures of which were determined 
simultaneously in 2004 by two groups [118, 119], have as¬ 
certained its previously unknown function as a dimeric sin¬ 
gle-stranded RNA binding protein (Fig. 6). In addition to its 
structure, nsp9 has been shown to interact with nsp8, while 
dual-labeling studies of SARS-CoV replicase proteins have 
demonstrated co-localization of nsp8 with nsp2 and nsp3. 
Zihe Rao and colleagues have determined the first crystal 
structure of a complex between two non-structural proteins, 
nsp7 and nsp8 (Fig. 6) [120]. Eight copies of nsp7 and eight 
copies of nsp8 together form an intricate scaffold that re¬ 
sembles a hollow cylinder. The inner dimensions and elec¬ 
trostatic properties of the cylinder suggest that it should 
encircle nucleic acid, and an interaction was demonstrated 
with dsRNA by EMSA and mutagenesis. The architecture 


and electrostatic properties are reminiscent of PCNA or the 
(3-subunit ring, the processivity factors of DNA polymerase, 
suggesting that the nsp7-nsp8 complex should be a proces¬ 
sivity factor for the RNA-dependent RNA polymerase 
(nspl2). Finally, the crystal structure of the ADRP domain 
of nsp3 has also been determined L121 ]. 

4.4. Other Targets for Anti-SARS Drug and Vaccine 
Design 

Other targets for anti-viral therapy and vaccine design 
have been identified in the SARS-CoV by structural biol¬ 
ogy. For instance, antigenic peptides of the coronavirus 
nucleocapsid (N) protein can be recognized on the surface of 
infected cells by T cells [122, 123]. The structure of the 
MHC-I molecule HLA-A*1101 in complex with such a 
peptide derived from the SARS-CoV N protein, a nonamer 
with SARS specific sequence, could be used as a template 
for peptide-based vaccine design [124]. DNA vaccines tar¬ 
geting the SARS-CoV nucleocapsid have been generated by 
two different groups [21, 23]. 

The crystal structure of the rigorously conserved stem- 
loop II motif (s2m) RNA element from SARS-CoV was 
determined earlier this year L125]. The unusual structural 
features form likely surfaces for interaction with conserved 
host cell components or other reactive sites required for 
virus function. This, together with the high sequence con¬ 
servation of s2m in an otherwise rapidly mutable RNA ge¬ 
nome, implies its pathogenic importance and signals that it 
could be another attractive target for the design of antiviral 
therapeutics. 

5. CONCLUSIONS 

Prior to the global SARS outbreak in 2003, Co Vs had 
not attracted enough attention from researchers because this 
genus of viruses causes severe diseases dominantly in ani¬ 
mals and only comparatively mild diseases in humans. The 
SARS epidemic led to extensive studies of this etiological 
agent, the pathogenesis of SARS, and vaccine and drug 
development. The CoV M pl °, spike protein, helicase, RNA- 
dependent RNA polymerase (RdRp) and possibly other viral 
(or cellular) proteins involved in pathogenesis are consid¬ 
ered to be targets for drug design. Among them, the M p ‘° has 
been characterized the most in terms of structure and bio¬ 
chemistry. Despite this, there are several open questions: (1) 
How does the M pro release itself from the replicase polypro¬ 
teins, in cis or in transl (2) Do alternative proteolytic proc¬ 
essing pathways exist for the replicase polyproteins in 
CoVs, as in equine arteritis virus? If so, how does the M p '° 
mediate the pathways? (3) Does the M pl ° involve in patho¬ 
genesis in CoV-associated diseases? To answer these ques¬ 
tions requires further structural and biochemical studies. 

Evidence suggests that Co Vs may have completed at 
least two animal-to-human interspecies transmissions to date 
[5, 6, 12, 31], resulting in the emergence of HCoV 229E and 
SARS-CoV. Although SARS has been contained, the dis¬ 
covery of the natural host for SARS-like CoV has aroused 
new fears for public health. Co Vs, especially those that can 
infect hosts such as domestic animals and pets in frequent 
contact with humans, remain a potential threat to human 
health assuming they cross the interspecies barrier again. 
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Fig. (6). A). The S protein fusion core “six-helix bundle” structures from Supekar et al. (left, [99]) and Xu et al. (right, [101]). The central 
HR1 peptides are shown in ribbon representation and colored red, blue and green. The HR2 peptides are shown in black. The N- and C- ter¬ 
mini are labeled. B). The SARS-CoV S protein receptor binding domain (RBD) in complex with the cellular receptor ACE2 [117]. The com¬ 
plex structure of ACE2 (in green) and the S protein RBD (core structure in cyan and the receptor binding motif (RBM) in red) is shown in 
ribbon representation and covered by a transparent surface. C). The dimer structure of nsp9, a single stranded RNA binding protein [118, 
119]. Each monomer is shown in ribbon representation and colored from blue at the N-terminal to red at the C-terminal. D). The structure of 
the nsp7-nsp8 hexadecameric complex. The complex structure is shown in ribbon representation and covered by a transparent surface. Nsp7 
is shown in green and the two conformations of nsp8 are shown in blue (I) and orange (II). 


Several key factors controlling the host spectrum and viral 
pathogenicity are highly variable among Co Vs, including 
the requirement of different host receptors for cellular entry, 
poorly conserved structural proteins (antigens), and diverse 
accessory genes in their 3'-terminal genome regions that 
probably contribute to the pathogenicity of CoVs in specific 
hosts [2, 8, 28, 29, 107, 126-128]. This structural and func¬ 


tional diversity presents a significant obstacle for the design 
of a versatile compound against all CoVs. For instance, a 
fusion peptide inhibitor derived from the MHV spike protein 
can not prevent SARS-CoV replication in cell culture [105]. 
Identification of the CoV M pl ° as a conserved target among 
all Co Vs will provide an opportunity for the development of 
broad-spectrum inhibitors against all CoV-related diseases. 



Design of Inhibitors Targeting Coronavirus M ro 

The development of inhibitors targeting SARS-CoV M pro 
has produced a number of positive results. Some compounds 
exhibit inhibitory effects on both the protease and antiviral 
activity. A series of inhibitors whose backbone incorporates 
a trans-a, fTunsaturated ester with a peptidyl portion seem 
particularly promising, and one of their analogues, ruprin- 
trivir, has entered clinical trials against rhinovirus infection 
[97J. Further modification of these compounds should be 
expected to rapidly lead to the discovery of a single agent 
with clinical potential against existing and possible future 
emerging CoV-related diseases. 
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